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Abstract 

We consider beam selection using a fixed beamforming network (FBN) at a base station with M 
array antennas. In our setting, a Butler matrix is deployed at the RF stage to form M beams, and 
then the best beam is selected for transmission. We provide the proofs of the key properties of the 
noncentral chi-square distribution and the following properties of the beam selection gain verifying that 
beam selection is superior to antenna selection in Rician channels with any if -factors. Furthermore, we 
find asymptotically tight stochastic bounds of the beam selection gain, which yield approximate closed 
form expressions of the expected selection gain and the ergodic capacity. Beam selection has the order 
of growth of the ergodic capacity 6'(log(M)) regardless of user location in contrast to 0(log(log(M))) 
for antenna selection. 

I. Introduction 

Deploying multiple antennas at a base station dramatically increases spectral efficiency. While multiple- 
input/multiple-output (MIMO) systems require multiple RF chains and elaborate signal processing units, 
Antenna selection has been an attractive solution for multiple antenna systems because only one RF chain 
is required to use the antenna with the highest signal-to-noise ratio (SNR). 

With promise of higher spectral efficiency, we focus on beam selection instead of antenna selection 
using a FBN at a base station which deploys M multiple linear equally spaced omnidirectional array 
antennas when each remote unit is equipped with an omnidirectional antenna. While the base station can 
adaptively steer beams to remote users using M RF chains, we investigate the Butler matrix, a simple 
FBN at the RF stage producing orthogonal beams and requiring only one RF chain for the best beam 
to be selected for transmission HI. The choice of the best beam can be achieved with partial channel 
state information (CSI) at the base station. The remote feeds back the index of the best beam to the base 
station for the forward link. 

Although beam selection has been known to have no advantage over antenna selection in ideal 
Rayleigh fading channels, it has been established (using analysis and simulations) that beam selection can 
outperform antenna selection in correlated Rayleigh fading channels with limited angle spread 0. For 
the case of Rician fading channels, there exist only limited analytical results of two very special cases of 
Rayleigh fading channels and deterministic channels except our own work in while simulations and 
measurements have shown that beam selection using the Butler FBN outperforms antenna selection 131 . 

Motivated by this, we have analyzed the performance of beam selection using the Butler FBN for Rician 
fading channels with arbitrary If -factors and derived the exact distribution of the beam selection gain as a 
function of the azimuthal location of the remote user in our previous work 131, where some key properties 
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Fig. 1. Beam selection system using the Butler FBN with M Unear equally spaced array antennas and 
beam pattern for M = 4 and d = Ac/2. 



of the noncentral chi-square distribution and the following properties of the beam selection gain have 
been presented without any proofs. Using these properties, we have compared the beam selection gain 
with the antenna selection gain for Rician fading channels and analytically proved that beam selection 
outperforms antenna selection. 

In this paper, we provide the proofs omitted in [31, which verify our claim that beam selection is superior 
to antenna selection regardless of user location in Rician channels with any i^-factors. Moreover, we 
find asymptotically tight stochastic bounds of the beam selection gain yielding approximate outage and 
the approximate expression for average performance. This approximation technique can be applied for 
most of average performance measures as shown for the expected selection gain and the ergodic capacity. 
Using these results, we obtain orders of growth of the expected selection gain and the ergodic capacity 
for beam selection, proved to be higher than those for antenna selection. 

The remainder of this paper is organized as follows: In Section JIJ we present our system model when 
the Butler FBN is used in the base station. In Section JIIJ we analyze the beam selection gain using a 
statistical approach. In Section |IVl we compare the gain of beam selection with that of antenna selection, 
and prove that beam selection outperforms antenna selection under any Rician channel transmission 
model. In Section |Vl we find stochastic bounds of the beam selection gain and approximate closed form 
expressions of performance measures. Finally, we provide our conclusions in Section |Vl] 

II. The System Model 

We consider a base station endowed with M > 2 antennas (as depicted in Fig. [TJ and remote units 
each endowed with one antenna. For the m-th port of the Butler matrix (m G {1,...,M}), the SNR 
equals to p ■ Fm regardless of the direction of the communication link 131, where p is the average SNR 
per port and Fm denotes the gain of selecting the m-th port. This gain is given by 

Frn = \blh\\ (1) 

where the M x 1 complex vector h = [hi, Hm]'^ represents the flat fading channel gains for corre- 
sponding antennas normalized such that E[|/ijp] = 1 for i = 1, 2, • • • , M, and the 1 x M complex vector 
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fo^ is the m-th row of the M x M Butler matrix given by 



(2) 



/ML 

Then, the base station chooses the port with the highest SNR. To select the best beam for the forward 
link, the remote user only needs to feed back the index of the best beam to the base station (even when the 
channel is not reciprocal) and this is the only difference between reverse and forward link beam selection. 
From this point on, we will not distinguish reverse and forward link beam selection in this paper as they 
are analytically identical. The SNR is then given by p ■ Ff^j^jy where the notation is used to denote 
the jn-th smallest value from any set of finite samples {zi, zm}, and thus /^(Af) = ^^^mG{i,...,M} ^m- 

We define the beam selection gain as the ratio of the SNR of beam selection with a FBN to the average 
SNR of random antenna switching without a FBN, which is given by /"(Af). 

III. Beam Selection Gains in Fading Channels 

It has been shown that beam selection outperforms antenna selection in ideal line-of-sight (LOS) 
channels, while beam selection performs as good as antenna selection in ideal non-line-of-sight (NLOS) 
channels |:4t We are interested in investigating the performance of beam selection under Rician channel 
models. This is the most frequently used realistic channel model in wireless communications. Under the 
Rician channel model, the normalized channel vector h can be modeled as multipath signals 



The entries of complex vector (which represents the normalized LOS component) are modeled to 
have unit power and fixed phase. The entries of the complex vector h jy (which represents the normalized 
NLOS component) are modeled by i.i.d. independent zero-mean circularly symmetric complex Gaussian 
random variables with unit variance. The parameter K is referred to as the Rician A'-factor, which 
represents the ratio of the LOS signal power to the NLOS signal power. The special cases oi K = oo 
and K = Q represent ideal LOS (deterministic) and ideal NLOS (Rayleigh fading) channels, respectively. 



A. Deterministic Components 

Consider the LOS component Hl. Let 9 denote the azimuthal angle of incident between a LOS signal 
and the line perpendicular to the linear equally spaced array antennas assuming two-dimensional geometry 
(horizontal plane) as shown in Fig. [T] Furthermore, assume that the distance between the base station 
and the mobile user is much larger than array antenna separation. Then for both reverse and forward link 
beam selection, /il is given by 



Hl = exp(jV') 



1, exp 



exp -j27r(M 



d 

-j2ir— sin ( 
Ac 

d . 
Ij-— sm/ 

Ac 



(4) 



where ip is an arbitrary phase shift of the signal from/to the first array antenna, d is the distance between 
adjacent array antennas, and Ac is the carrier wavelength. 

Let the SNR gain of the m-th beam in ideal LOS channels {K = oo) be denoted by 

|2 



r M, 

= < J_ sm''(M(/)„./2) 
I M sm^((/)„/2) ' 



if (/>m = 27rn, 
otherwise, 



n G 



(5) 
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where 



27r 



1 

M 



m 



Xr 



■ sm I 



(6) 



Since ft.^ is a function of 9, 7^ is also a function of 9 and let us call a set of M functions {7m|?7z 
1, M} a beam pattern, which has the following properties: 



M 



^-fm. = M, < 7^ < M; 

m=l 

M if and only if 



n n 
27r — , — E 
M M 



7r7 



if and only if 



2tt- 



n n 



(7) 

(8) 
(9) 



M' M ^ ' 

where the azimuthal angle satisfying ^ is the beam direction. Let us define a lobe of a beam as a mam 
/oZ?e if the beam direction is inside that lobe. We assume 



M- 1 d 

2M ^ Yr 



(10) 



for all M beams to have at least one main lobe. We examine the beam pattern only from = to the 
first beam direction given by 



= arcsm 



1 Xc 
2M d 



(11) 



as discussed in |[3l. 



B. Probabilistic Analysis 

Now, let us consider the statistical channel model including NLOS components. The cumulative 
distribution function (cdf) of Fm is given by I3j 

F^(x) ^ Pr{r„<x} 

= E[F^.(2(i^ + l)x|n + 2P,/2)]t„^2,5=2X^„' (^2) 

where K^'2(x|n, 5) is the noncentral chi-square cdf with n degrees of freedom and the noncentrality 
parameter 6, Ps/2 is a Poisson random variable with mean 6/2, and F-)^2(^x\q) is the chi-square cdf with 
q degrees of freedom, given by 

F,.(.|,) = l-e-^/^'f:'(^ = e-^/^ f; M?)! (13) 

k=0 k=q/2 

if q is an even number as in (f72l) where q = n + 2^5/2 1 n=2- Note that given K, evaluating 7™ is enough 
to know the distribution of the SNR gain Fm- The beam selection gain /^(m) is given by 

M 

F^M){x) = Pr{r(jv/) < :e} = J] (14) 

771=1 

and thus for x > 0, 

M 

logF(A;)(x) = logFm(x). (15) 

m=l 
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We have the following useful key theorem on the noncentral chi-square distribution, whose proof can 
be found in the Appendix. 

Theorem 1: The logarithm of the noncentral chi-square cdf with two degrees of freedom 



\ogF^..{x\2,5) 



(16) 



is a strictly decreasing and strictly concave function of the noncentrality parameter 5 > for any given 
X > assuming that the base of logarithm is greater than one. □ 
Now, we are ready to show the following theorem, where stochastic order relations are introduced in 
iS Ch. 9]. 

Theorem 2: For any given x > 0, the cdf of the beam selection gain /'(^j, is a strictly 

decreasing function of 6 from zero to the first beam direction v = arcsin(2^^). Therefore, in this 
interval, Tj-^/) is stochastically increasing, stochastically smallest at ^ = 0, and stochastically largest at 

e = v. □ 

Proof: This proof is given in the Appendix. ■ 
The corollary below follows naturally from Theorem |2l 

Corollary 3: For 6 G [— 7r/2,7r/2] and any integer \m\ < yj^, /"(m) is stochastically increasing as 9 
increases if 



arcsm 



m Ac 



arcsm I mm 



m + 1/2 Ac 
M ~d 



(17) 



and stochastically decreasing as 9 increases if 

9 e 



. ( fm- 1/2 Ac ^ 

arcsm max < , — 1 

V \ M d 



arcsm 



rri Ac 
M d 



(18) 



(A/) 

z/ are achievable stochastic lower and upper bounds, respectively for /^(a/) with an 

□ 



It is exactly opposite for the other half of the horizontal plane, 9 G [7r/2, 37r/2]. Therefore, /^(a/) with 
9 = and 9 
arbitrary 9. 

Corollary [3] tells us that the expected performance measures over /^(a/) with ^ = and 9 = v can 
serve as lower and upper bounds, respectively, for the averages of any performance measures which are 
increasing functions of SNR, e.g., the channel capacity. They can also serve as upper and lower bounds, 
respectively, for the averages of any performance measures which are decreasing functions of SNR, e.g., 
the bit error rate (BER), applying the result in lH pp. 405-406]. 



IV. Beam Selection Versus Antenna Selection 

Let us consider the antenna selection gain under the same scenario used for beam selection case except 
the fact that the Butler FBN will not be deployed for antenna selection. When the m-th antenna is selected 
among M antennas in the base station, the SNR is given by p ■ Hm, where Hm = |/imP- Assuming that 
the antenna with the highest SNR is always selected, the antenna selection gain is defined as the ratio of 
the SNR of antenna selection to the average SNR of random antenna switching, which can be expressed 
by i^(A/)- For any m, the cdf of Hm becomes 

G{x) = PT{Hm <x} = F^^2{2{K + l)x\2,2K). (19) 
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Therefore, the cdf of -ff(A/) is given by 

G(A/)(x) ^ Pt{H(^m) <x} = G'\x). (20) 

With the proofs of previous theorems, we can confirm that the following lemma holds. 

Lemma 1: For the same Rician ET-factor, beam selection always outperforms antenna selection, i.e., 
the beam selection gain /^(m) is stochastically larger than the antenna selection gain -ff(A/)- □ 
Proof: Applying the concavity result in Theorem [T] and Jensen's inequality gives us 

logG(M)(x) = MlogF^,2{2iK + l)x\2,2K) 

M 
■m=l 

= logF(A/)(x), (21) 
for any given x > 0. ■ 



V. Asymptotic Selection Gains 
It has been shown that the beam selection gain is stochastically upper and lower bounded by 
with 9 of zero and the first beam direction z/ = arcsin (237^)' respectively. Our interest in this section 
is to see how these two extremes change as the number of antennas M increases and then obtain the 
asymptotic selection gain for an arbitrary location of the remote user. Furthermore, these analytical results 
can be applied to study the outage and the ergodic capacity of beam selection systems. For this purpose, 
consider the SNR gain Fmid) and its cdf Fm{-\9) as functions of the azimuthal angle 6. 



A. Bounds and Approximations 

First, we can obtain the stochastic lower bound for the beam selection gain of the user at the beam 
direction /^(a/ ) {i^) given by 

M 

FiM)ixW) = n F^'<'^{K + l)x\2,2K^M) 

m=l 

= F^,2{2{K + l)x\2,2KM) ■ F^i-'^{2{K + l)x\2) 
= Qm{x)W^-\x) 

< Qm{x), (22) 
where Q and W are defined by 

Q^{x) = F^,2{2{K + l)x\2,2K'y), (23) 
W{x) = F^2{2{K + l)x\2). (24) 

Fig. |2] shows F(A/)(x|z^) and its stochastic lower bound Qm. It can be seen that the lower bound Qm 
approaches to the cdf F(a/)(x|z/) as M increases, which will be proved. 

Now, consider the beam selection gain of the user exactly between beams r(Af)(0) and its cdf given 

by 

M 

FiM){x\0) = n Fy2{2{K + l)x\2,2Kj^{0)). (25) 

m=l 
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40 
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Fig. 2. Distributions of the beam selection gain r(jvf)(z/) and its stochastic lower bound for K 
— 10, 0, 10 dB, where d = Ac/2 is assumed. 



Let us choose a vector u = 

71 (0) = 7A/(0) = 



•'^(Af)] which majorizes the beam pattern {7m(0)|?7i = 1, ...,M} as 

1 



u 



(M-l) 



> 



U{M-2) = U(M~3) 



Msin2(7r/2M) 
1 



aM 



M 

T ~ Msin2(7r/2M) 



bM > 72(0) = 7A/-i(0) 



> 73(0), ...,7M-2(0) > 
where majorization is introduced in 16] p. 45]. 



-"(1) = 0, 



(26) 
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Notation: For any two real-valued sequences cm and du, we define 

Cm ^ d-M if and only if limjv/^oo |cm - c^mI = 0; 

Cm ~ dM if and only if limj^/^oo CKi/dM = 1; 

Cm = &{dM) if and only if < liniA/^oo CM/dM < oo. 

□ 

Using this notation, we can see 

4 

AM ~ -^M = (0.4053...) X M, (27) 

6m ~ Q - ^ M = (0.0947...) X M. (28) 

Applying Hardy-Littlewood-Polyas theorem in |f6], pp. 88-91] and the strict concavity of ([T6l ) to (1251 ) 
yields the stochastic upper bound 

M 

F^M){x\0) > H F^,2{2{K + l)x\2,2Ku^m)) 

■m=l 

= F^,.{2{K + l)x\2,2KaM) ■ F^, . (2(K + l)x|2, 2/^6^./ ) • F^i-^{2{K + l)x\2) 

= Qljx)-Qljx).W''~\x). (29) 

Thus, we have the stochastic lower and upper bound for F(^m){x\0) given by 

Ql,{x) > F(M)(x|0) > Ql,,{x) . Qljx) . W^'-\x). (30) 

Fig. [3] shows F(M)(a^|0) and its stochastic lower bound Q^^^ and upper bound Q'^^^Q'^^^W^'^~^. We 
also observe that as the lower and upper bounds are merged into each other, so does F(j\/)(x|0) as M 
increases. 

The following theorem verifies that the stochastic lower bounds in (l22l ) and ( [30l ) are indeed asymptot- 
ically tight. 

Theorem 4: For K > and p £ [0, 1), 

F[^{pW) ^ Ql^hp) (31) 

and 



F, 



(M)(P|0)«Qa2(v^) (32) 



as M increases. □ 
Proof: This proof is given in the Appendix. ■ 
We also have the following theorem useful for average performance evaluation, whose proof can be 
found in the Appendix. 

Theorem 5: Let h be any differentiable function defined on [0, oo) such that h' is bounded. If h is 
integrable with respect to Qm, then 



h{x)dF(^M){xW) ~ / h{x)dQM{x) (33) 
Jo 

as M increases. If h is integrable with respect to Qa^,^ then 

/•oo 

hix)dF^M)ix\0) ^ h{x)dQl^^{x) (34) 
Jo 

as M increases. □ 
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A:=10dB, 8 = 




10 15 
Selection Gain 



Fig. 3. Distributions of the beam selection gain r(j\/)(0) and its stochastic lower and upper bounds for 
K = —10, 0, 10 dB, where d = Ac/2 is assumed. 



Theorems |4] and [5] in this subsection demonstrate that for large M, the distributions of the beam 
selection gain of the user at the beam direction -/^(m) i'^) ^^i^ the beam selection gain of the user exactly 
between beams /^(m)(0) can be well approximated by Qm{x) and Q'^^^{x), respectively, which are the 
noncentral chi-square distribution and its square. These are useful as their closed-form expressions are 
complicated and thus not insightful. 
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B. Performance Analysis 

It can be seen that outage probabilities with = and 6 = v for n given rate Cq can be approximated 

by 

A 



Pontic^) = Pr {log2 (1 + pT^M){e)) < Cq] 



Qm 

QaM 



P 

2^0-1 



if6 = u, 
if 6 = 0, 



(35) 



for large M. Furthermore, Theorem |4] can be used to approximate outage capacities with 6 = and 
9 = u as 



Cout(-Po) 



log2 
log2 



(36) 



"1 + pQm(^o)], if 9 = 1^, 

l + pQaliV^o)], if = 0. 

for large M. 

Let us apply Theorem [5] to the mean selection gain E [P(Af)] by taking h{x) = x. The expected beam 
selection gain for 9 = u is given by 

KM + 1 

xdQMix) = \ =&{M). (37) 



The expected beam selection gain for = is given by 



K + l 



IE[r(M)(0)] 



xdQl^Ax), 



(38) 



as M increases. Although it seems difficult to solve the integration in ( 1381 ). we can obtain upper and 
lower bounds using an inequality in jV] p. 62] because Q^^^ is the cdf of the maximum of two samples 
from Qa^, , whose mean and variance are {Kqm + 1)/ {K + 1) and {2KaM + 1)/ {K + 1)^, respectively. 
These bounds are given by 



Kau + 1 
K + 



|_ 1 r-OO 



KaM + l 1 ^/2KaM + 1 



+ 



K + l V3 K + l 



which yields 



(39) 



(40) 



K + l 

Hence, E [-^(a/)] = 0{M) regardless of user location, which is faster than ©(log M) for antenna selection 

m. 

Lemma 2: Let p > denote SNR. As M increases, the ergodic capacity of the user at the beam 
direction {9 = v) is given by 

KM + r 

K + l 

and the ergodic capacity of the user exactly between beams {6 = 0) is given by 



E [log2 (1 + pr(M){y))] « log2 1 + P 



(41) 



E[log2 (l + />r(A/)(0))] «log2 1 + P 



KttM + I 

K + l 



(42) 

□ 



Proof: This proof is given in the Appendix. 
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10 



15 



20 



25 



30 



M 



Fig. 4. Ergodic capacity versus M for A' = OdB at p = 5dB, where d = Ac/2 is assumed. 



This lemma also yields the order of growth of the ergodic capacity E [log2 (l + ^/^(m))] ~ 0(log(M)) 
regardless of user location, which is faster than 6'(log(log(M))) for antenna selection Fig. |4] shows 
the ergodic capacity and its approximations in (|4TI ) and (l42l ) for SNR p = 5dB. We see that the 
approximations approach the numerically integrated exact values as M increases. 



VI. Conclusion 

We considered beam selection using the Butler FBN at the base station with multiple linear equally 
spaced omnidirectional array antennas. Completing the analysis of the beam selection gain, we provided 
the proofs of the key properties verifying that beam selection is superior to antenna selection in Rician 
channels with any i('-factors. We also found asymptotically tight stochastic bounds of the beam selection 
gain and approximate closed form expressions of the expected selection gain and the ergodic capacity. 
Using these results, it was shown that beam selection has higher order of growth of the ergodic capacity 
than antenna selection. Graphical results were provided demonstrating the underlying gains and supporting 
our approximations. 
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Appendix 

Proof of Theorem |7} Without loss of generality, assume the natural logarithm. For any given x > 0, 
T6l) can be expressed as 



\ogF^.2{x\2,5) = log 
5 



Li=0 

+ log 



-5/2 



{5/2r 



-Oli 



E 

.j=0 



-OLi 



where a, is defined as 



ai = K,2(x|2 + 2i) =e-^'/2 ^ 



fc=j+i 



(43) 



(44) 



from (fT3] ). Differentiating (1431 ) gives us 



logF^..(x|2,5) 



1^1 E£o 



oo (5/2)' 



2 ' 2 W^)' n-- 

2^1=0 il 



V^oo ((5/2)- / 



^oo (5/2)' 



< 



(45) 



2 Ei=0 

for (5 > because a^+i < Oj from (l44l ). and thus ( fT6l) is a strictly decreasing function of 5 > 0. 

Now, prove that ([T6l ) is a strictly concave function of 5 > 0. The second derivative of (1431 ) is given by 



^oo (V2)' . 



I "i+l 



Eoo 
«=0 



4 E 



the i-th order term of whose numerator can be simplified as 

— - — (aoai+2 - aiai+ij- 



(46) 



(47) 



Let us show that (|46l ) is negative by proving that (|47l ) is negative for (5 > 0. Consider aj_i/aj, which is 
an increasing function of i because 



OLi 



v^oo {x/2)>' 
l^k=i+l k\ 

1 

Y^oo {x/2)*' 
l^k=l (i+k)\/i\ 



and {i + k)\/i\ increases as i increases for any positive integer k. Therefore, 



which yields the strict concavity of (1161 
Proof of Theorem |2l' Define 



ai 02 ai+2 



/3 = 27r-^sine. 



Ac 



(48) 



(49) 



(50) 
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Under the condition (fTOb . (3 is an increasing and continuous function of 9 and has the range 
Therefore, we only need to show that F(,„) is a strictly decreasing function of f3 in the domain 
From 



n — 
n — 



^{f3)^ — {m--]-f3 = 4>m{0)-P, (51) 



A 27r / 1 



and by defining 



we can represent 



M if 4> = 27rn, n G N 

M sm^(<^/2) 



V{<^) = { otherwise, ^^^^ 



7^(/3) = (53) 

Note that 7]{(j)) is a periodic function with period 2it, symmetric with respect to the axis (f) = Tin, and 
the value of at = 27rn makes ri{(j)) a continuous function of cj). 

First, prove that for (5 G [O, the beam pattern {7™} can be sorted in nonincreasing order as follows: 

71 > 7A/(/3) > 72(/3) > 7A/-i(/?) > - > (54) 

where [-J is a floor function. It can be easily shown that 

^M+i-M = = 7m(-/3)- (55) 

We get the following equivalent inequalities of (l54l ) 

r?(.^i(/3)) > ??(0i(-/?)) >...>?? (0Lii±iJ ((-l)^-V)) • (56) 

We can see that 
and 

< 0l(/3) < M-P) < - < <PlM±ll < 7T, (58) 

which yields ( [56l ) because in dSTl) . the numerator sin^ {^4') has the same value at (p = ((>m.{±l3) for 
any fixed (3 and all m, and the denominator sin^ {^4>) is increasing function of cf) € [0, vr]. Define the 
nondecreasingly sorted vector 7 from {7m} given by 

7 = [7(i)>7(2),---,7(Af)] 

= 7LiL!.J+i) •••'7A/-i,72,7Af>7i (59) 

for P € [0, Let us show that 7(/32) strictly majorizes 7(/3i) for < /3i < /32 < which means 

M M 

Y.^{i)iPi) = Y.^{i)iP2) (60) 

i=l i=l 

and 

m m 

E^»(^i)>E^»(^2) (61) 

i=l i=l 
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for all m G {1, M — 1}. We already have (l60l) from and thus it suffices to prove (|6TI ). Under the 
assumption that (|6TI ) is proved, using Hardy-Littlewood-Polyas theorem in 161 pp. 88-91] based on the 
strict concavity of (fT6l) proved in Theorem [T] gives us 



M 



logF(M)(x|iC,/?i) = Y,\ogF^,.{2{K + l)x%2K^m{Pi)) 



m=l 



M 



> logK,-(2(A- + l)x|2,2J^7„(/32)) = log F(m)(x|K, /Js), 



(62) 



m=l 



which basically shows that is a strictly decreasing function of p. 

Let us prove that 71 (/3) and ■~fM{l3) are strictly increasing and strictly decreasing respectively. For 
(f) 7^ 27rn, it can be shown that 



Moot — f 



cot ( 



(63) 



We can show /?'((/>) is negative for < (/> < ^ because by the Taylor series expansion, 



M cot - 



M 



2 ^ 2^'\B2i\ ( M 



2i-l 



n °o 92i|R I 
Z Z |i32i| 



2j-l 



oo 22*1^, 



2i\ 



\ (2^)! 



(M' 



2i 



2i-l 



< 



where Bi is the i-th Bernoulli number. Therefore, 77(0) is strictly decreasing in [O, ^] 
strictly increasing in \^tx ^^~j^ , 27r] by the symmetry. Since 

7,(ffl=,(i-^) 



and 



7M(/3)=r/(2^^^^^^-/3), 



(64) 

and thus ?7((/)) is 
(65) 
(66) 



we have proved our claim. 

Now, consider the case when M > 3 and m = 2, ...,M — 1. We can see that if m < (M + l)/2, 
7A/+i~m(/?) is strictly decreasing because the numerator and the denominator in (l52l ) are strictly de- 
creasing and strictly increasing respectively as functions of /?. Moreover, we can show the fact that 
lm{0) +7Af+i-m(/5) is Strictly decreasing, which can lead to the consequence that 7^+1 is strictly 

2 

decreasing for odd M and thus 7M+i-m(/3) is strictly decreasing for m = (M + 1)/2 as well. It suffices 
to prove that 

7:n(/3) + 7M+i-m(/3)<0 (67) 

for /3e (0,^). From dSl]), 

7Af+l— m (/?) 



1 sin^ (M(/)^(/3)/2) 1 sin^ (M(/)^(-/?)/2) 
M sin2 (0^(/3)/2) M sin^ (0^(-/3)/2) 



M 



■ cos 



— /3 ) [csc2 (0„,(/?)/2) + csc2 (</.^^(-/3)/2)] , (68) 
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because it can be shown that 



sin^ {M^m{fi)/2) = sin^ (M<^^(-/?)/2) = cos^ —(3 



M 



By defining 



/(/?) ^ cos^ (^p 



giiP) ^ csc2 {<Pr,^{P)/2) , g2{P) = esc' (</.„(-/3)/2) , 



^ „„„2 



we have the expression 



7 m + 7m+i- 



M 



f {gi +92) 



f 91+92 



Since / > and gi + g2 > for P ^ (O, -fj), we only need to show 



/' , 9[+92 



f 91+92 



< 0. 



Simple derivations give us 



= -Mtan ( y/? 



CSC 



/ 

2 / VmW) 



CSC 



2 / 'rrn. 



cot 



and similarly 



92 = - csc^ 



'Am(-/5)\ V<Am(-/5) 



We get 



= -Mtan( ^/?) + 



CSC 



.(/3) 



cot 



cot 



2 



CSC 



2 f 



cot 



2 



CSC 



+ CSC^ 



2 



Because < i;^m(/9)/2, (^m(— /9)/2 < vr, applying the mean value theorem yields 



CSC 



cot 



.(/3) 



CSC 



2 f^rJr-p) 



cot 



-2 csc^ V cot^ V ~ csc^ V'- 



for some ^ G ((/)^(/3)/2, (/>„(-/?)/2). Then, 



h{(3) = -Mtan ( — /? ) + 



/3csc^V'(2cos2'0 + 1) 



CSC 



2 



+ CSC^ 



2 / -^^("/3) 
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We can see that for ■0 G [(/).m(/5)/2, /?)/2], csc^ 0(2 cos^ 0+1) has maximum at either ■0 = 0m(/3)/2 
or ip = 0m(— /3)/2 and let it be denoted by ipi,. We are ready to show the following series of inequalities 



h{l3) < 



* 2 CSC2 0fe + 1 



< 



< 



-2 + 



10 



3 — cos 2'ipb 



M2 
M2 



< — + 3^ 



/3 + 3/3 csc^ ipb 
1 



,2 ji_ 

M 



< 



M 



,f2 



■/5 



-1 + 



< 0, 



(80) 



where the last inequality holds as M > 3. This proves ( 173] ). and thus ( [671) follows. 
It is clear that Yl^Li 7(j)(/3) strictly decreasing for all m G {1, ...,M — 1} because 

7M+l-m(/3) (81) 

and 

7m(/3)+7A/+l-m(/3) (82) 

are strictly decreasing for m = 1, [^^^^^J , which we has been proved above, and Yl^i 7(i) iP) becomes 



either the sum of (1821 ) for multiple m or the sum of (1811) and (1821 ) for multiple m. The validity of (162b 
completes our proof. ■ 
Proof of Theorem^ All functions in (|3T]) and (l32l ) take the value if and only if j; = 0. Thus, we 
can assume p G (0, 1). To show (OTI ). define xi = Q]^}{p) and X2 = F^j^{p\u) and this yields 

Qm{x2)W^'~Hx2) 



Qm{xi) = ^M[.X2)W' -[X2) =P 

Let us introduce a new variable X3 to obtain upper bound for X2 given by 



Qm 



P 



> X2 > Xi. 



(83) 



(84) 



,VF*^-i(xi), 

Let us show x^ ^ xi, and then X2 ~ xi in (|3T]) follows immediately. The value of xi can be computed 
using Sankaran's approximation in ||9l, where it has been suggested that for a random variable X with 
the cdf Fy.r2{x\n, 6), {X - {n - l)/2}^/^ - {5 + {n - l)/2}^/^ is approximately zero mean Gaussian 
with unit variance and this approximation improves if either n or 6 increases. Thus as M increases, 

^ 2" 



1 



xi 



2{K + 1) 
K 




2KM + - 
2 



+ cD-i(p) 



-M, 



K + l 

where is the inverse function of the Gaussian cdf given by 

1 



^>(x) 



dt. 



(85) 



(86) 



SUBMITTED TO THE IEEE TRANSACTIONS ON INFORMATION THEORY 



17 



Let us use the notations fxp and to denote the mean and variance of distribution F, respectively. 
Then, it can be shown that 

QM-l + C 2 _ ^^/6 



2{K + 1) 



I 2 ' 



(87) 



{2iK+i)r 

where qm-i = W"^ (1 - 1/(M - 1)) w ln(M - 1) and C = 0.5772... (Ruler's constant) HI. For 
xi > fiwM-i (this is true for all M > C for some C), applying one-sided Chebyshev's inequality in ifTOl 
p. 152] yields 

l-W^^-\xi)< ^ 



and thus 



where 



1 < 



< 1 + CM, 



< CM 



W^-^{xi) 



v2 



(88) 



(89) 



(90) 



by ([85] ) and ([87l ) as M increases. We have 



X3 - Xi 

< Qll{p{l + eM))-Qllip) 

, 7 K M 4- 
K+l\ 

1 



+ 



^— (2KM + i j ' + sm)) - ^-\p)} 



2{K + l 
1 



0, 



(91) 

as M increases, because {2KM + 1/2)^^^ • eA/ ~ from (|90l ). Hence, (|3T] ) is proved. Now, (l32l ) can be 
shown similarly. For any p G (0, 1), let us define X4 = Qa^iVP) ^^'^ ^5 



VP 



>X5>i^(M)(p|0)>X4. 



From (I30ll . defining xq yields 

Assuming xq X4, we have F^j^j^{p\0) w X4 in (l32l ). Now, as M increases, it can be shown that 

K 



X4 



K + 1 



(92) 



(93) 



(94) 
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as above. Note that Qb,jW 2 is the distribution of the maximum of two independent random variables 

A/ -4 

following Qhj^j and W ^ . it can be easily proved that 

Kb&i + 1 qM_± + C 

K + l ^2{K + l) 
K 



K + l 



hu (95) 



and 



2 22/*^ 

CT AJ-4 < (Tri + <7 Af-4 + /i„ Af-4 

KbM + l vrVe /qm^+CV 

(K + 1)2+{2(K + 1)}2 + |^2(K + 1)J 



Once again using one-sided Chebyshev's inequality, 



bM. (96) 



1 ^ , / . < 1 + ^M, (97) 



2 

M -4 

U < — 



where 



X4 — U M-4 

1 l/2-4/7r2 1 
^(8/^2 _ 1/2)2 M 

= (0.9820...) X (98) 

As If increases, this implies {2KaM + 1/2)^^"^ ■ e'j^ k, 0, which leads us xg ~ X4 as in ( |9T] ). ■ 
Proof of Theorem |5} Let us show (l33l) . first. Let Xj? denote a random variable following any 
distribution F. Obviously, -^(a/) (^) is stochastically larger than Xq^^ from (l22l ). Using the idea of coupling 
E Sec. 9.2], define 

^(A/)M=^(M)(QM(^Q„)k). (99) 

Then, r^^j^i-^{u) and r^^^~^{v) share the same distribution but r^^j^{v) > Xq^,^ with probability L By the 
mean value theorem, we have 



HriM)i^)) - KXqJ = h'{e) ri,,){u) 



Xr 



(100) 
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for some e £ (^Xq^, , T^j^j-^iu)^. Using this, 

|E[/i(r(M)M)-M^Q.J] 



= E[hiri,j^{,.))-h{XQj 

< E[|/i(r(*,,)(i/))-MXQ„) 

< c-E[|r(*A,)(z.)-XQ„|" 
= c-\K[r^M){i^)-XQ,,]\, 



(101) 



where \h'\ is bounded by C. Now, let us show E [-^(a/)(^)] ^ ^ [^Qm]- For ^^ly x > 0, 

Qm{x) > F^M){xW) = Qm{x)W^'~\x) 



> 



[Qm{x) + W 



M-l, 



1 



(102) 
(103) 



where \-r is defined as 

y if y > 0, 

if y < 0. 

As Q]\f + W^^~^ — 1 is an increasing and continuous function of [0, oo) onto [—1, 1), there exists only 
one a > such that 

QM{a) + W^^-\a) -1 = 0. (104) 



Therefore, 



'y) - Xr 



< E [r(M) 

l-OO 

= / [{l-F^M){x\y))-{l-QM{x))\dx 
Jo 

l-OO p 

< / Qm{x) - [Qm{x) + W'^'-^x) - 1] 

ra poo 

= / QM{x)dx+ [l-W^^~^{x)]dx 

Jo Ja 

rf3 roo 

< / QM{x)dx+ [l-T^^-^(x)]dx 
JO J a 



dx 



(105) 



for any > as (11051 ) can be minimized by choosing /3 = a. Let us obtain the upper bound for the first 
term of ( 11051 ) using the Marcum Q-function defined and bounded as 



xe^''"+''"^/^Io{ax)dx 



> 1 



■ exp 



--(a -6)2) ifa>6, 



(106) 



where Io{x) is the modified Bessel function of the first kind with order zero fTT|. Using the connection 
between the Rice distribution and the noncentral chi-square distribution with two degrees of freedom, it 
can be shown that 



QMix) = F^,2{2{K + l)x\2,2KM) 

= 1 - (V2KM, ^/2{K + l) 



(107) 
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From (11061 ) and (11071 ). the first term in (11051 ) is bounded as 



/3 



< 



QM{x)dx<l3QM{l3) 



l-^{K + l)(5/{KM) 
for (3 < KM/{K + 1). If we take p such that 



exp 



-KM 1 



K 



KM 



— < — — -. 

M^oo M K + 1 



(108) 



(109) 



then (11081 ) goes to zero. Consider the second term of (11051 ). Note that W is the exponential distribution, 
which has an increasing failure rate (IFR) |[T2l Sec. 3.2]. From the chains of implication in |[T2l p. 159], W 
is a new better than used (NBU) distribution, which is closed under the formation of coherent systems 
including parallel systems, and thus the distiibution W^^~^ is a new better than used in expectation 
(NBUE) as well as NBU. Using the bound for NBUE in Oil p. 187], the second term in (fTOSl ) is 
bounded as ^ 

[ [l-W^^^-\x)]dx < nw^'-^e-'^/''^"-\ (110) 



Note 



ln(M - 1) + C 



(111) 



2{K + l) 

from (|87] ). and thus we can find a sequence f3 such that dllOl ) converges to zero as M increases while 
limA/^oo P/M < K/{K + 1), e.g., P = 0.5KVM /{K + 1). We now prove dSl)- It can be seen that 

Qljx) > F^M){x\0),Qljx)Qljx) 

> Qljx)Qljx)W''-\x). 

By the similar reasoning as above, it needs to be proved that 



E[^(M)(0)] «IE 
as M increases. We can easily show E X, 

E Xn 2 



Q2 Q2 



E 



^ E 
Xn2 



(112) 
(113) 



Xr 



as above. Assuming 



(114) 



yields E 



Xr 



E 



Xr 



this proof. For this, we need to find a sequence /? > such that 

/■/3 POO 

/ Ql^{x)dx+ / [l-Qljx)]dx^O 
Jo J 13 

as M increases. To make the first term of (II 151 ) diminish, /? can be chosen as 



, and thus (11131 ) follows. Hence, we will show (1114b to complete 

(115) 



lim < . 

M -»oo OA/ K + 1 



(116) 
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As Qbj^^ is the noncentral chi-square distribution with two degrees of freedom, Qb^^ is IFR |[T3l . and thus 
Qf^^ is NBUE as above. From the definition of NBUE in |[T2l p. 159], the second term of (11151 ) is upper 
bounded as 



poo 

/ [l-Qljx)]dx < f^QiJl-QlM 
For a < b, Marcum Q-function is upper bounded as lITTI 



b — a \ 2 



(117) 



(118) 



Then, (II 171) can be further bounded as 

[i-QlM dx 

KbM + 1 1 



< 4- 



K + 1 l-l/y/{K + l)(3/{Kb 



: exp 



Kb 



M 



which goes to zero if we take /3 such that 

K 



^ < lim - — < oo. 

K +1 A/^oo bM 



(119) 



(120) 



From the growth rates of om and b^j in (l27l ) and (|28] ). (3 can be selected such that dl 161 ) and ( 11201 ) are 
satisfied simultaneously, e.g., f3 = 0.25 • KM/{K + 1), which proves (11141) and (l34l ) consequently. 



Proof of Lemma^ Obviously, log2(l + px) is integrable with respect to Qm and Qajj 

E [log2(l + pXq„)] < log2 (1 + PMQ„) 



and 



E 



log2(l + PXqiJ\ < log2 (^1 + PPQ 

by Jensen's inequality. From these and Theorem [51 we have 

E [log2 (1 + pr(^M){i^))] ^ E [log2(l + pXqJ] 

and 

E [log2 (1 + pr(M)(0))] ^ E [log2(l + pXq.^^ ) 
as M increases. Then, we need to show that 

E [log2(l + pXqJ] « log2 (1 + PPQ,,) 

and 



E 



log2(l +/3Xq2 



log2 {1+PPQ2 



(121) 

(122) 

(123) 
(124) 

(125) 
(126) 
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as M increases. Assuming that these are true, (|4T]) and (l42l) follow naturally from (l37l) and ( [39l ). We will 
now prove (I125I ). By Chebyshev's inequality, for any given e > 0, we have 

E[log2(l + pXQ,J] 

log2(l + px) dQuix) 



> 



> 



log2(l + WfQ„) 
log2(l + P/iQ,J 
log2(l + 



l-Pr^log2fi^^')<-- 
1 _ Pr ^ -^Af - PtQ,, ^ AtQM + 1/P ("-^ _ 2-e/2 



1 



o"Qa 



(l-2--/2)(^Q„ + l/p) 



> log2(l + P/^Q J - I - log.(l + P^Q J _ 2^./2"(;^^^ + 1/^) 



(127) 



> log2(l + p/iQ„) - e 

for large enough M because fiq^^, = {KM + 1)/{K + 1) given in (137]| and cta/ = ^/KM + 1/{K + 1), 
which proves (I125I ). Moreover, (1126b can be shown similarly as /xq2 > hq^^^ = {Kgm + + 1) 

and a(52 < V^ag^^ = yj2{2KaM + ^)/{K + 1) by the variance bound in 111 p. 69] ■ 
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